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SOME  GRAPHICAL  CONSIDERATIONS  IN  TIME  SERIES  ANALYSIS 

by  Benjamin  Kedem 

ABSTRACT 

The  pictorial  information  in  a  stationary  time  series  as 
depicted  by  crossings  of  levels  and  crossings  of  random  levels 
and  related  quantities  is  studied.  It  is  shown  that  such  graphi¬ 
cal  features  are  directly  connected  with  the  covariance  function 
and  hence  with  the  spectral  density.  Many  of  these  features  can 
be  actually  applied  in  estimation  and  in  the  study  of  extremes. 

In  the  Gaussian  case,  the  finite  dimensional  distributions  are 
completely  determined  by  the  axis  crossings  and  by  the  crossings 
of  a  random  curve  (to  be  defined)  if  the  process  is  essentially 
bounded. 
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SOME  GRAPHICAL  CONSIDERATIONS  IN  TIME  SERIES  ANALYSIS 


by  Benjamin  Kedem 


1.  INTRODUCTION 

From  the  review  articles  by  Beniger  and  Robyn  (1978)  and 
Fienberg  (1979)  we  learn  that  graphical  methods  for  depiction  of 
empirical  data  is  an  old  and  useful  idea  which  may  be  traced 
hundreds  of  years  back.  However,  no  theory  of  statistical  graphics 
exists  although  graphical  depiction  of  data  is  a  well  accepted 
practice.  Concerning  this  last  remark,  this  is  even  so  when  the 
data  at  hand  are  graphical  by  their  very  nature,  and  the  case  we 
have  in  mind  is  time  series  analysis.  True,  theory  as  such  does 
not  exist  but  still  we  may  advance  graphical  methods  of  analysis. 
The  point  to  be  made  here  is  that  graphical  methods  of  analysis 
are  almost  always  available  given  a  long  time  series. 

In  this  paper  we  shall  demonstrate  the  usefulness  of  some 
graphical  aspects  in  time  series  analysis  by  answering  questions 
such  as: 

1.  What  does  it  mean  that  a  time  series  appears  to  be  bounded? 

2.  How  can  we  interpret  the  number  of  crossings  of  fixed  and 
random  levels? 

3.  What  can  we  learn  from  the  succession  of  times  spent  by  a 
series  above  and  below  a  random  level? 

4.  Are  there  useful  graphical  features  to  be  reckoned  with  in 
time  series  data? 

5.  For  very  high  levels,  what  should  we  observe  in  a  time  series 

■  idei  to  determine  the  distribution  of  extremes? 
u  A it  the  axis  crossings  by  the  successive  differences  of  a  time 

series  useful  and  what  happens  if  we  difference  indefinitely? 
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In  fact  we  will  show  that  the  spectral  density  of  a  stationary 
process  is  a  function  of  some  graphical  features  which  can  be  de¬ 
tected  by  a  quick  eye  examination.  This  means  that  at  least  in 
principle  spectral  analysis  amounts  to  a  careful  eye  examination 
of  time  series  graphs.  This  is  independent  of  the  Gaussian  assump¬ 
tion.  For  a  zero-mean  stationary  Gaussian  process  we  can  even  say 
much  more:  all  of  the  finite  dimensional  distributions  can,  at 
least  in  principle,  be  determined  from  graphical  features,  under 
some  conditions  often  met  in  practice. 

Our  main  concern  in  this  paper  is  to  express  well-known 
quantities  in  terms  of  graphical  features  thus  emphasizing  the 
pictorial  information  contained  in  the  plot  of  a  stationary  time 
series.  For  this  purpose  we  shall  sometimes  create  features  by 
introducing  useful  random  curves  and  by  clipping  the  series  at 
various  levels.  In  what  follows,  axis  crossings  and  crossings  of 
"random  curves"  and  other  similar  visual  features  play  a  dominant 


role. 
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2.  RANDOM  CLIPPING 

For  a  0-mean  Gaussian  process  it  is  well-known  that  its  cor¬ 
relation  structure  is  a  function  of  the  "zeros"  of  its  sample 
paths.  Much  the  same  can  be  achieved,  at  least  in  theory,  for 
any  bounded  series. 

Let  {Zt , t=0 ,±1, . . . }  be  a  zero  mean  strictly  stationary 
process.  This  is  a  strong  assumption  which  is  made  here  for 
simplicity.  Assume  that  there  exists  an  A  such  that 


|Zt|  <  A.  (2.1) 

Then  the  process  is  also  weakly  stationary  of  order  k  for  any 
k.  In  the  range  (-A,A)  we  define  a  uniform  process 
{Ut,t=0,±l,...}  independent  of  Zt  and  made  of  independent  and 
identically  distributed  random  variables,  such  that 


(u) 


2^  ,  -A<u<A 
0  ,  otherwise. 


(2.2) 


Thus,  corresponding  to  a  time  series  Z^,...,Z^  there  is  a  "random 
curve"  U-^,...,Uj,  which  helps  us  to  define  a  clipped  binary  series 
(V  by 


1  ’  zt~Ut 


0,  Zt<Ut 


,  t-1 , . . . ,N. 


(2.3) 


■p . . . ,U^  will  be  referred  to  as  the  U-curve 


The  time  series  U 
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and  throughout  the  paper  it  is  assumed  that  N  is  large.  Observe 


that  EY 


t  =  j  and  for  t^t..  ,i,  j€(l, . .  .  ,N) 


EY,  ... Y  =  P  (Z^  >lh  >lk  ) 

tl  tr  r  H  *1  tr 


(2.4) 


—  E(Z  +A). . . (Z.  +A) 
(2A)r  xl  tr 


so  that  all  the  cross  moments  of  {Z^}  can  be  obtained  from  the 
moments  of  {Y^}.  In  particular  from  (2.4)  we  obtain  an  important 
relation  between  the  covariance  functions 


y7 (k)  =  4A  Yv(k)  ,  k*0,  k=l ,  2,  ... 


(2.5) 


and  so  the  covariances  in  the  {Y^}  process  are  reduced,  which  for 
binary  data  amounts  to  a  lesser  dependence. 

Now  consider  the  quantity 


ci  =  2jk  -  2|vt-i  -  <w 


(2.6) 


This  is  readily  seen  to  be  the  number  of  symbol  changes  in  the  bi¬ 
nary  series.  But  then  it  is  also  the  number  of  "curve  crossings" 
where  the  curve  is  the  U-curve  defined  as  the  random  series 
(2.5)  and  (2.6)  lead  to 


J  2E(C  ) 
Yz(l)  =  A  1 


(2.7) 


We  see  that  Y^( D  is  a  linear  function  of  the  expected  number  ol 
U-curve  crossings  by  Z^,...,Z^.  An  unbiased  estimate  of  y2(1)  is 
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Y,<1>  =  A2  1  -  mTT 


N-l  • 


(2.8) 


A 

We  can  get  hold  of  an  approximation  to  the  variance  of  y^d) 
as  follows.  Let  B  >  A  and  assume  is  uniform  in  (-B,B) 

while  (2.1)  still  holds.  Then  in  (2.4)  B  replaces  A  and  as 


B  increases 


Pr( Y  =1,...  , Y  =1)  -►  (i)1 
T1  r  z 


in  which  case  (Kedem  (1980a))  we  easily  see  that  C-^-»-b(N-l,i) 


Var( y„ (1) ) 


B 

N-l  * 


(2.9) 


The  lesson  to  be  learned  here  is  two  fold.  First,  we  have  made 
use  of  a  conspicuous  graphical  feature  and  second,  by  employing 
an  artificially  large  upper  bound  B  on  |Z  |  in  the  definition 
of  Ut ,  the  clipped  data  are  nearly  independent.  This  is  a  de¬ 
sirable  property  of  random  clipping. 

Next  define 


N  N 

c2  =  2prt  -  2jvt_2  -  2' 


(2.10) 


Then 


2E(C,) 

YZ ( 2 )  =  A  1  "  N-2 


(2.11) 
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But  apart  from  a  negligible  end  effect,  is  twice  the  number 

of  successive  sojourns  above  and  below  the  U-curve  of  at  least  two 
time  periods.  That  is,  twice  the  number  of  runs  in  the  clipped 
binary  series  between  the  first  and  last  l's  with  at  least  two 
symbols  plus  1.  We  can  continue  in  this  fashion.  Y^Y  ^ 

also  defines  the  number  of  certain  sojourns  but  it  is  simpler  to 
switch  now  the  eye  to  the  binary  series  between  the  first  and  last 
l's.  Then 

N  N 

£y ,-jY  Y  q  =  #  of  runs  with  at  least  3  symbols  +  #  of  0011 
1  T  4  t  t-d 

+  #  of  0101  +  2. 

For  example  in  the  binary  series 

00010100010101000011110101001111001 
ZYt  -  EYtYt_3  =  16  -  5  =  11. 

But  the  number  of  runs  with  3  or  more  symbols  (concentrating  on  the 
series  between  the  first  and  last  l's)  is  4,  the  number  of  0011 
is  2  and  the  number  of  0101  is  3.  The  sum  of  these  numbers  plus 
2  is  11  as  it  should.  We  can  now  define  in  terms  of  these 

features  and  then  express  y^O)  in  terms  of  ECC^).  In  general, 
we  define 

N  N 

Ck  =  2lY  -  2  l  Y  Y  -  k  ,  k=  2 , 3 , . . .  (2.12) 

k  1  r  k+1  t  X  * 

where  is  obtained  by  counting  the  number  of  times  (Z^f-lJ^) 

.nd  the  number  of  times  ^zt“^t ,Zt-k2^t-k^ * 
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And  by  replacing 
estimate.  If 


Yz(k) 


A 


2E(Ck)' 

N-k 


( 


ECC^)  by  in  (2.13)  we  obtain  an  unbiased 

is  defined  so  that 


E(C0) 


(0) 


5 


(2.13)  holds  for  k=0,±l,...,  and  assuming  that  a  spectral 
density  f(A)  exists  we  have  for  sufficiently  large  M  and  N 


f(X) 


=  —  1 
2irk=-~ 


ikA  2 
e  A 


2E(C.  ) 
k 

N-k 


«  A 


sin(H+i) A 


2Trsin^A 


1  M  ..  . 

-  I  e-lkiE(C.) 
wN  k=-M  K 


( 


Whence  for  bounded  stationary  processes  the  spectral  density  is 
essentially  the  Fourier  transform  of  the  expected  numbers  of 
vi: ual  features  such  as  U-curve  crossings  and  lengths  of  sojourns 
above  and  below  the  U-curve. 


2.13) 


2.14) 


i 
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3.  THE  GAUSSIAN  CASE  AND  LOSS  OF  INFORMATION 

How  much  information  is  lost  due  to  clipping?  This  is  a 
disturbing  problem  associated  with  hard  limiting  operations.  In 
the  Gaussian  case  however,  at  least  in  principle,  the  answer  is 
that  no  information  is  lost  if  clipping  and  random  clipping  are 
combined.  That  is,  our  graphical  features  contain  a  great  deal 
of  information. 

To  show  this  choose  an  A  so  large  relative  to  Var(Z^) 
that  (2.1)  holds  for  all  practical  purposes.  Next  define 


X 


t 


1,  zt>0 

,  t= 0  >  ±1 , . . . > 

0,  Zt<0 


(3.15) 


where  { Z^_ }  is  a  zero-mean  stationary  Gaussian  process.  Then  it 
is  well-known  that 


Yz(k)  =  y^(  0)  sin(  27TYx(k) )  (3.16) 

and  by  invoking  (2.5) 

4A  y  (k) 

YZ(0)  =  sin(2TTYx(k)')  »  k=1>2>---  (3-17 

If  counts  the  number  of  axis-crossings  by  Z^,...,Z^,  then 

for  k=l  we  obtain  from  (2.7)  and  (2.17)  after  representing  D-j^ 
as  in  (2.6),  but  with  Y's  replaced  by  X's, 
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V0) 


Cos 


2E(CX )  -I 

N-l 

ttE(D1) 

N-l 


(3.18) 


This  means  that  the  variance  of  a  Gaussian  process  can  be  obtained 
by  observing  rhe  U-curve-and  axis-crossings  by  a  time  series  from 
the  process.  Therefore  from  (2.13)  and  (3.18),  for  a  bounded, 
practically  speaking,  zero-mean  stationary  Gaussian  process,  the 
finite  dimensional  distributions  are  completely  determined  graphically. 
To  make  this  statement  more  precise  we  should  attach  a  probabilistic 
statement  to  the  bounds,  require  that  long  records  be  available 
and  that  the  covariance  function  decays  fast  enough,  in  which  case 
(3.18)  and  (2.13)  provide  consistent  estimates  when  the  expectations 
are  replaced  by  observed  values.  This  finding  helps  to  explain  the 
remarkable  fact  that  a  great  deal  of  inference  about  a  stationary 
Gaussian  process  can  be  made  from  clipped  data  (Kedem  (1980a)). 
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4.  HIGHER  ORDER  CROSSINGS 

Can  we  get  a  formula  equivalent  to  (3.18)  in  the  general 
case  for  any  bounded  stationary  process?  The  answer  to  this 
question  is  in  the  affirmative  if  we  are  willing  to  consider  the 
U^^-curve,  say,  crossings  by  VZ^.  =  Z^-Z^  .  Assume  the  same 

properties  for  {Z^}  as  an  Previous  section  so  that 

|VZ  |  <  2A.  (4.1) 

Let  (U^15 }  be  the  uniform  independent  process  corresponding  to 
{VZ^}  so  that  is  uniform  in  (-2A,2A).  Then  from  (2.7) 


Jj , . . . ,UN,  and  the  number  of  crossings  by  VZ^y...,VZ^  of 


,U 


1 
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We  have  seen  that  the  IT  -curve  crossings  by  are  useful 

(k) 

and  a  natural  question  arises  as  to  how  useful  the  U  -curve,  say, 
crossings  by  V  Z^  are.  To  answer  this  question  note  that 


|VkZt|  2  2kA  ,  k=  0 , 1 , 2 , . . . 


(4.5) 


Ik]  # 

and  let  (U  }  be  the  corresponding  independent  process  uniform 

k  k  . 

in  (-2  A,  2  A)  which  consists  of  independent  and  identically 

(k)  (k) 

distributed  random  variables.  We  call  the  time  series  ,...,U^ 

(  k) 

the  U  -curve  which  we  introduce  as  above  in  order  to  create 

(k) 

graphical  features.  Let  C  be  the  number  of  crossings  of  the 

U(k)-curve  by  , .  . .  , VkZN .  Then  from  (2.7) 


(k) 

v  (1)  -  22kA2  1  -  2EAC  >  k-0  1  2 

YVkZ(1>  '  2  A  1  FPI  *  k-0,1,2,.... 


(4.6) 


where  C^03  is  taken  as  C^. 
Observe  that 


Y  k  (1) 
VKZ 


■  -P- 


(0)  + 


2kur 


y7(l)  -  +...+  (-1)  Y7(k+1)  (4.7) 


so  that  Yz(k+1)  can  from  (4.6),  (4.7)  be  expressed  as  a  function 
of  E(C(k>),  k=0,l, . . . ,k,  and  E(C2>.  We  see  that  the  C(k),  the 
number  of  U^k3 -curve  crossings  by  VkZt>  t=l»...»N,  together  with 
C2  completely  determine  the  covariance  function  of  a  stationary 


bounded  process.  We  call  the  C 


the  higher  order  U  -curve 


i n  analogy  with  the  higher  order  IT  -curve  crossings  we  can 
fine  the  higher  order  axis  crossings  by  7kZt.  More  precisely 
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let  {Z^ }  be  a  stationary  process,  not  necessarily  bounded,  and 


(k) 


define  a  binary  process  {X^  }  by 


,(k)  . 


k-  1 

1,  7  Zt>0 


0,  vk-1<o 


,  k=l,2 , . . . 


(4.8) 


Given  a  time  series  Z^,...,Z^  we  define 


Dk  =  2IXtk)  '  2Ixtk)xt-l  "  (x[k>+XMk)> 


(4.9) 


k  k 

This  is  the  number  of  axis  crossings  by  7  Z^,...,7  Z^.  But  then 
D-^  is  the  number  of  axis  crossings  by  the  original  series,  and 
apart  from  end  effects,  Dj  is  the  number  of  local  maxima  and 
minima,  D3  is  the  number  of  inflection  points,  etc.  Thus  the 
first  few  D^'s  correspond  to  conspicuous  features  in  a  time 
series.  A  natural  question  to  ask  is  whether  features  which  are 
not  that  conspicuous  or  not  at  all  for  that  matter  are  still  use¬ 
ful.  Before  considering  this  question  we  note  that  results 
similar  to  (4.6)  can  be  obtained  for  the  D^’s  in  the  Gaussian 
case  without  reference  to  boundedness. 

Wow  going  back  to  our  question,  we  note  that  { Z admits  a 
spectral  representation  with  respect  to  a  process  of  orthogonal 
increments  {£(  A) 


Z 


t 


(  —  IT  ,  IT  3 


eltAdSQ). 


(4.10) 
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It  is  convenient  to  give  an  answer  to  our  question  via  (4.10). 
Observe  that 


Corr(VkZt>VkZt_j)  -  (-1)3  ,  k  +  ».  (4.14) 

This  means  that  on  each  finite  time  (discrete)  interval  x£k) 
xor  uge  k  tends  to  consist  of  binary  strings  in  which  a  0 
s  followed  by  a  1  and  vice  versa.  In  fact  it  was  recently  shown 
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in  Kedem  and  Slud  (1980)  that  we  actually  have  weak  convergence 


of  {X<k)} 


{X^  =>  {...010101...}  >  k  -  », 


(4.15) 


where  the  O'th  coordinate  is  either  0  or  1  with  probability 
i.  Whence  in  this  sense  the  Dk's  which  count  the  number  of 
symbol  changes  in  finite  records  provide  less  and  less  information 
as  k  increases.  In  fact  numerous  simulations  show  that  out  of 
N-l  possible  symbol  changes  about  80%  are  achieved  already  by 
D-^g.  Hence  only  Dk  for  rather  low  k  are  useful.  (4.15)  is 
called  the  Higher  Order  Crossings  Theorem. 

An  important  application  of  the  higher  order  crossings  Dk's 
is  in  the  discrimination  of  time  series.  For  this  purpose  we 
make  use  of  another  .consequence  of  the  Higher  Order  Crossings 
Theorem.  It  can  be  shown  that  for  long  and  even  moderate  records 
lengths  the  Dk  actually  increase!  This  motivates  the  statistic 


.2  _  ?  (VEV 

*N  "  E4. 

k=l  k 


(4.16) 


where 


Dx  ,  k  =  1 


Ak  =  |Dk  "  Dk-1  *  k  =  2’**  * ,K  “  1 


(N-l)  -  Dk_1  ,  k  =  K 
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so  that  for  sufficiently  large  N  2  0  and  =  N-l. 

2 

Extensive  simulations  indicate  that  has  an  extremely  robust 

distribution  which  varies  only  mildly  from  process  to  process. 

This  statistic  has  proven  useful  in  many  cases  and  we  shall  report 
more  on  its  applications  in  the  near  future  elsewhere.  It  should 

be  noted  that  for  Gaussian  processes  the  expected  values  EA^ 
can  be  computed  exactly. 
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5.  A  REMARK  ON  THE  CASE  WHEN  MOMENTS  DO  NOT  EXIST 

One  of  the  great  advantages  of  graphical  features  such  as 
axis  crossings  of  stationary  sequences  is  that  regardless  of  whether 
moments  are  finite  or  not  or  whether  they  exist  at  all,  the  number 
of  axis  crossings  by  a  series  of  length  N  has  moments  of  all 
orders.  In  such  cases  the  axis  crossings  have  a  strong  case  for 
their  use  in  inference.  To  bring  a  concrete  example,  consider 
the  strictly  stationary  first  order  autoregressive  Cauchy  process 

Zt  =  *Zt-l  +  Ut  5  t=0»±l>---  (5-D 

where  |  4>  |  <  1 ,  and  the  u^  are  independent  Cauchy  random  vari¬ 
ables  with  characteristic  function  e~ ^ I  ♦  I  ^  I  s  I  .  Then  {Z^_  > 
is  a  strictly  stationary  process  with  Cauchy  marginals  having  the 
standard  characteristic  function  e“ I s L  In  the  usual  situation 
when  2nd  order  moments  of  exist,  0  is  the  correlation  be¬ 
tween  Z^  and  so  that  realizations  appear  either  "smooth" 

or  oscillatory  depending  on  the  sign  of  0.  In  the  Cauchy  case 
<p  is  no  longer  a  correlation  but  still  the  degree  of  oscillation 
depends  on  its  sign!  This  means  that  <j>  is  essentially  a  function 
of  the  degree  of  oscillation  in  the  process  or  equivalently  the 
number  of  axis  crossings.  This  can  also  be  seen  from  Table  1 
which  is  the  result  of  a  simulation  in  which  (5.1)  was  generated 
for  different  values  of  tp  and  the  corresponding  numbers  of  axis 
crossings  in  series  of  length  N=1000  were  recorded.  It  is  inter¬ 
esting  to  observe  that 
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1.  AXIS  CROSSINGS  BY  CAUCHY  SERIES  (5.1)  OF  LENGTH  1000 
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by  comparison  with  previous  results  in  the  Gaussian  case,  the 
Cauchy  series  yields  fewer  axis  crossings  for  positive  <f>  but 
more  crossings  for  negative  <j>.  In  addition,  it  was  shown  in 
Kedem  (1980a)  that  in  the  Gaussian  case  the  estimate  cos(  irD^/ (N-l)  )  , 
where  stands  for  the  number  of  axis  crossings,  is  a  remarkably 

good  estimate.  Taking  these  observations  into  account  we  suggest 
for  the  Cauchy  case  the  estimate 


cos 


/TTD1(l+a)\ 


N-l 


I 


(5.2) 


where  a  is  a  correction  factor  which  has  the  same  sign  as  <(>. 
a  =  0  corresponds  to  the  Gaussian  case.  Table  1  gives  some 

A 

<J>a  for  various  a.  Obviously  a  increases  and  decreases  with 
<{> .  We  intend  to  investigate  this  estimate  in  a  future  study. 
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6.  FEATURES  AND  PROBABILITIES 

We  have  seen  above  that  the  number  of  symbol  changes  in  a 
clipped  or  randomly  clipped  binary  series  is  a  useful  quantity 
which  can  and  should  be  used  in  inference.  Similarly,  runs  of 
various  lengths  and  types  are  also  important  as  they  summarily 
portray  useful  graphical  information  about  the  original  series. 
From  these  various  types  of  runs  or  subsequences,  we  can  actually 
compute  various  probabilities  of  interest.  Specifically,  we  will 
briefly  outline  how  to  obtain  the  asymptotic  distribution  of  the 
maximum  in  a  stationary  series  provided  certain  conditions  are 
assumed.  But  first  we  define  a  useful  feature  called  a  "unit." 

An  m'th  order  unit  is  a  binary  sequence  which  starts  with  a 
1,  ends  with  m  separating  0's  (if  needed  to  separate  it 
from  other  units)  and  in  which  each  0-run,  if  not  an  end  run, 
consists  of  at  most  m-1  0's.  Note  that  the  length  of  1-runs 

is  not  restricted.  For  example  in  the  binary  series 


There  are  4  unit  of  order  3  and  7  units  of  order  2  and  10  units  of 
first  order.  If  3  is  the  fjighest  order  under  consideration,  then 
there  are  9  0’s  which  do  not  Jelong  to  any  unit  and  we  call  them 


the  fre' 


Thus,  if  binary  sequences  are  perceived  in  terms 


of  uni , s  then  the  information  in  such  sequences  is  neatly  summarized. 
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Now,  let  be  any  stationary  series  of  length  N  and 

clip  the  series  at  a  certain  level.  This  yields  a  binary  series 
X. .  Let  n. .  ,  be  the  frequency  of  ij...k  in  the  binary 

x  13  •  •  •  -K 

series,  and  let  S  be  the  number  of  l's.  Assume  now  that  the 
highest  order  unit  is  m.  Then  the  number  of  m'th  order  units 
is 


s  -  nn  -  n1Q1  -  n10Q1 


(6.1) 


and  the  number  of  free  0's  is 

(N-s)  -  m(  #  of  m'th  order  units  -1) 


-  (m-2)n. 


- . . . "  n 


101' 


(m-l)n 


(6.2) 


Assume  that  as  the  level  at  which  the  series  is  clipped,  the 

binary  series  displays  an  m-th  order  Markov  dependence  where  m 
ma>  be  even  very  large.  Observe  that  as  the  level  increases  the 
l's  become  rare.  It  can  be  shown  then  under  some  conditions 
(Kedem  (1980b)  that  for  a  high  level  and  large  N 

#  of  permutations  of  the  m'th 
order  units  with  the  free  0's 

m  m+1 


n-s-m 


(6.3) 


Pr(S=s)  ~ 


.  1  •  •  •  X  .  __ 

t  t-1  t-m 


Pr(x„ 


I  Xt-l=xt-l *  *  *  * ,Xt-m=xt-m) ' 


where 
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Now  let  P10Q_>0  ♦  0  such  that  NP100>_Q  =  0  is  fixed.  Then 
it  is  not  difficult  to  see  that 

Pr ( S  =  s )  ■*  e"8  0S/s!  . 

Therefore  for  large  N 

Pr(max  Z  <  a  high  level  u)  ~  e-E(#  exceedances  above  level  u) 
l<t<N  t 

If  the  I's  tend  to  cluster,  a  similar  argument  replaces  the  number 
of  exceedances  of  level  u  by  the  number  of  upcrossings  of 


level  u. 
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7 .  SUMMARY 

We  have  illustrated  briefly  the  connection  between  visual 
quantities  such  as  crossings  of  fixed  and  random  levels  and  the 
covariance  function  of  a  stationary  process,  some  parameters  of 
interest  and  the  distribution  of  the  maximum  in  stationary  series. 
In  particular  we  have  focused  on  some  graphical  features  of  time 
series  which  contain  a  great  deal  of  information  useful  in  in¬ 
ference.  For  this  purpose  we  have  created  features  by  clipping 
at  random  levels  and  by  changing  the  position  of  fixed  levels  in 
a  controlled  manner.  Features  such  as  peaks,  troughs,  inflection 
points,  axis  crossings  by  the  k’th  difference  of  a  stationary 
process,  etc.,  are  useful  up  to  a  point.  This  is  the  subject  of 
the  Higher  Order  Crossings  Theorem. 
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